Process Discovery with Precedence Constraints

نویسندگان

  • Gianluigi Greco
  • Antonella Guzzo
  • Luigi Pontieri
چکیده

The automatic discovery of a process model out of a historical log traces can be of great value for both analysis and design tasks, and is a major goal of Process Mining approaches. A key step in discovering such a model consists in detecting a graph of causal/precedence dependencies over process activities – which can be possibly used to eventually derive more expressive control-flow (e.g., Petri-net based) models. To this end, most of current approaches exploit greedy heuristics and discard apriori some dependencies assuming that the given log is complete (i.e., it covers the possible behavior of the process) – so risking to be ineffective when working with high-concurrency processes and with noisy and temporally-biased logs. Despite the usage of a-priori knowledge could improve the effectiveness, efficiency and robustness (w.r.t. incomplete/noisy data) of dependency mining algorithms, it has been given little attention so far. This paper fills the gap by proposing a constraint-based process discovery framework, where background knowledge can be encoded via precedence constraints, while the search of dependencies can be encoded as either a constraints satisfaction or a constraints satisfaction optimization problem. The computational complexity of such problems is studied deeply, and their tractability frontier is traced w.r.t. the different classes of constraints involved. The whole approach has been implemented in a prototype system, leveraging on a solid constraint programming platform, and tested on both synthesized and real log data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SINGLE MACHINE DUE DATE ASSIGNMENT SCHEDULING PROBLEM WITH PRECEDENCE CONSTRAINTS AND CONTROLLABLE PROCESSING TIMES IN FUZZY ENVIRONMENT

In this paper, a due date assignment scheduling problem with precedence constraints and controllable processing times in uncertain environment is investigated, in which the basic processing time of each job is assumed to be the symmetric trapezoidal fuzzy number, and the linear resource consumption function is used.The objective is to minimize the crisp possibilistic mean (or expected) value of...

متن کامل

Single Machine Scheduling Problem with Precedence Constraints and Deteriorating Jobs

This paper considers the single machine scheduling problem with precedence constraints and deteriorating jobs. A mathematical model based on binary integer programming (BIP) is developed. By these precedence constraints, jobs can not start before completion of its all predecessors. The represented model is in two steps, in step one the earliest starting time of each job is computed, then the re...

متن کامل

Multi-Objective Unrelated Parallel Machines Scheduling with Sequence-Dependent Setup Times and Precedence Constraints

This paper presents a novel, multi-objective model of a parallel machines scheduling problem that minimizes the number of tardy jobs and total completion time of all jobs. In this model, machines are considered as unrelated parallel units with different speeds. In addition, there is some precedence, relating the jobs with non-identical due dates and their ready times. Sequence-dependent setup t...

متن کامل

A Process Discovery under Precedence Constraints

Process discovery has emerged as a powerful approach to support the analysis and the design of complex processes. It consists of analyzing a set of traces registering the sequence of tasks performed along several enactments of a transactional system, in order to build a process model that can explain all the episodes recorded over them. An approach to accomplish this task is presented which can...

متن کامل

Fault Management in Communication Networks: Test Scheduling with a Risk-Sensitive Criterion and Precedence Constraints

We consider the problem of determining the optimal sequence of tests for the discovery of a faulty component, e.g., in a telecommunications network, where there is a random cost associated with testing a component. A novel feature in our approach is that a risksensitive performance criterion is used in order to rank different competing schedules. We characterize optimal schedules both when the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011